Knowledge Acquisition from Amino Acid Sequences byMachine Learning System BONSAI Authors
نویسندگان
چکیده
We present a machine learning system, called BONSAI, for knowledge acquisition from positive and negative examples of strings, and report some experiments on protein data using the PIR and GenBank databases. This learning system is constructed with an algorithmic learning theory for decision trees over regular patterns, which is newly developed for this work. As a hypothesis, the system tries to nd a pair of a classi cation of symbols called an alphabet indexing and a decision tree over regular patterns, which classi es given examples with high accuracy. Through the experiments, the system discovered very simple hypotheses that exhibit important knowledge about transmembrane domains and signal peptides.
منابع مشابه
BONSAI Garden: Parallel Knowledge Discovery System for Amino Acid Sequences
We have developed a machine discovery system BONSAI which receives positive and negative examples as inputs and produces as a hypothesis a pair of a decision tree over regular patterns and an alphabet indexing. This system has succeeded in discovering reasonable knowledge on transmembrane domain sequences and signal peptide sequences by computer experiments. However, when several kinds of seque...
متن کاملHow to predict it: Inductive Prediction by Analogy Using Taxonomic Information
This paper presents a novel machine learning technique in a logic programming environment: Inductive Prediction by Analogy (IPA). IPA learns the description a target predicate similar to a source predicate from examples of the target predicate. Akey feature of IPAis that it uses analogies to constrain the space of hypotheses using taxonomic information represented by first-order predicate logic...
متن کاملDiscovery of Functional Components of Proteins from Amino-acid sequences based on Rough Sets and Hierarchical Reasoning
Protein structure analysis from DNA sequences is an important and fast growing area in both computer science and biochemistry. Although interesting approaches have been studied, it is very difficult to capture the characteristics of protein, since even a simple protein are made of more than 100 amino acids, which makes biochemical experiments very difficult to detect functional components. For ...
متن کاملبررسی تاثیر اجرای نظام پیشنهادها بر یادگیری سازمانی
The purpose of this paper is to study of the casual effect of implementing sug-gestions system on three major aspects of organizational learning known as knowledge acquisition, knowledge distribution and knowledge utilization. Since organizational learning is becoming more important in the world of knowledge-based businesses, managers are more interested to know which management systems & proce...
متن کاملAlphabet Indexing by Cluster Analysis: A Method for Knowledge Acquisition from Amino Acid Sequences
Knowledge acquisition has been an important topic in Arti cial Intelligence and a variety of contributions have been made in various elds where computers can be applied. Genome Informatics is one of the most attracting elds for which knowledge acquisition techniques are strongly expected. In [3] a knowledge acquisiton system for sequence data has been developed and has shown successful experime...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007